- 01. Introduction
- 02. OpenAI Gym: BlackjackEnv
- 03. MC Prediction: State Values
- 04. Implementation
- 05. Mini Project: MC (Parts 0 and 1)
- 06. MC Prediction: Action Values
- 07. Implementation
- 08. Mini Project: MC (Part 2)
- 09. Generalized Policy Iteration
- 10. MC Control: Incremental Mean
- 11. Quiz: Incremental Mean
- 12. MC Control: Policy Evaluation
- 13. MC Control: Policy Improvement
- 14. Quiz: Epsilon-Greedy Policies
- 15. Exploration vs. Exploitation
- 16. Implementation
- 17. Mini Project: MC (Part 3)
- 18. MC Control: Constant-alpha, Part 1
- 19. MC Control: Constant-alpha, Part 2
- 20. Implementation
- 21. Mini Project: MC (Part 4)
- 22. Summary